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Introduction 

D'.rlng 1968-69 ten communities throughout the United States cooperated 
with the Far West Laboratory for Educational Research and Development to test 
a model Follow Through Program In kindergarten and first grade. The model, 
called the Responsive Follow Through Program, has two major objectives. 

1. To help children develop a positive self-image, and 

2. To help children develop their Intellectual abilities. 

This paper addresses Itself to the second objective and describes the 
testing program and Instruments used to measure changes In Intellectual ability 
of eight kindergarten and two first grade Follow Through classes during the 
1968-69 school year. First, the testing program Is described. Test results 
for each comnuntty are then presented and discussed. This Is followed by a 
general summary and discussion. Finally, limitations of the testing procedures 
and plans for further analysis of the data are presented. 

Before describing the methods used to measure Intellectual development 
In kindergarten and first gride children, efforts to evaluate the Responsive 
Follow Through model should be put Into perspective. 

Educators and researchers tend to measure the effectiveness of an 
educational system or model by pointing to gains In objective test scores. 
Emphasis Is placed on changes made on objective tests for at least two reasons: 

1. The objective testing movement Is so Ingrained In our society that It 
Is natural for an educator, researcher or parent to look solely at 
objective test results as Indicators of a program's success, and 

2. Because of the emphasis on objective testing of cognitive skills, 
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development of ways to measure other educational outcomes have re- 
ceived only minor attention. For example, there is no "widely ac- 
cepted' 1 instrument to measure a child's self-concept, his desire 
for 1 earning , or his attitude toward school. Until instruments are 
developed to measure these "affective" goals, we will continue to 
rely on objective test results for evaluation. 

One other point should be made: The word "Preliminary" is used in the 
title of this report for a special reason. 8ecause of the long-term nature 
of our goals, a one-year assessment of the degree to which the Responsive 
Model achieves these goals cannot be adequate. For this reason, various 
Follow Through model developers, called sponsors, and the National Follow 
Through office have placed a moratorium on between-sponscr, experimental- 
control group comparisons. Comparisons made between sponsors, other than 
describing and contrasting methods, are premature at this time. 

11. The Testing Program 

To measure the development of a kindergarten child’s Intellectual abili- 
ty during his nine month experience In the Responsive Model, a four-test, 
battery reflecting some of the program's objectives was administered to the 
eight Follow Through kindergarten classes In the fall of 1968 and again In 
late Spring of 1969. 

A separate Instrument appropriate for first grade, was administered In 
pre-post fashion to first grade Follow Through classes. Comparison classes, 
selected for similarity to the Follow Through classes, were also pre-and post- 
tested with the same battery. For reasons already discussed, the test results 
of comparison groups are not presented In this paper. 

A. The kindergarten test battery was composed of four Instruments; 

1. A short form of the Wechsler Preschool and Primary Scale of 
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Intelligence (WPPSI) J The WPPSI was designed to measure intelli- 
gence In young children aged four to six and a half. Subtests are 
Individually admlnistred by a trained person. Only four subtests 
were selected for administration because they were the best mea- 
sures of the objectives of the model and of the subtests in the 
WPPSI. They provide the most reliable estimate of intelligence. 

The four subtests that made up the short-form WPPSI were: 

WPPSI Subtest Related Responsive Objective 

a. Vocabulary Language development 

b. Similarities Concept formation 

c. Picture Completion Perceptual acuity 

d. Block Design Problem solving 

The Vocabulary subtest requires the child to answer such ques- 
tions as "What Is a shoe?" and "What does "chisel" mean?" This sub- 
test measures word knowledge and potential for dealing with symbols. 

The Similaritie s subtest measures verbal concept formation by re- 
quiring the child to make verbal associations within such categories 
as clothing* foods and toys. The Vocabulary and Similarities subtests 
combine to yield a Total Verbal Score. 

The Picture Completi on subtest Is a measure of verbal concentration. 
It requires a child to find missing pieces In a picture and to be aware 
of objects In his world. 

The Block Design subtest requires the child to analyse and reproduce 
an abstract design. It also appears to measure Insight Into space re- 
lations. The Picture Completion and Block Design subtests can be com- 
bined to give a score on Total Performance . 
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The combination of the Total Verbal and Total Performance scores 
was used as a measure of a child's intelligence and is referred to as 
the Total WPPSI score. To get the Total WPPSI score, raw scores for 
each of the four subtests were converted to scale scores using differ- 
ent age tables. These four scale scores were then added. Based on 
the standardization sample, the general population of children would 
have an average Total WPPSI score of 40. The standard deviation of 
these scores would be around 10. 

The WPPSI Is a relatively new Instrument and research on its ac- 
curacy Is scant. However, because construction and administration 
procedures parallel procedures used In developing other Wechsler 
scales, the WPPSI enjoys the Wechsler reputation. Wechsler scales 
are reliable and valid for measuring Intelligence and predicting 
scholastic success. 

2 

2. The Caldwell Preschool Inventory (PSI) . The PSI was developed 
to be used with children from three to six years of age to measure 
achievement In areas regarded essential for success In school. The 
author, Bettye M. Caldwell, claims that the test Is not culture-free, 
but was designed to permit educators to highlight the degree of a 
child's Intellectual deficit. The 85-ltcm Instrument yields four ma- 
jor factors plus a total score. The factors accounting for most vari- 
ability are called " Concept-Activation .” The "concepts” are composed 
of ordinal or numerical relations and sensory attributes such as form, 
color, size, shape and motion. The "activation" Involves "being able 
to call on established concepts to describe or compare attributes”*, 
for example, to relate shapes to objects or to "execute motorlcally 




— — 

Caldwell, Bettye M., "The Preschool Inventory, Technical Report", Cooperative 
Test Division - Educational Testing Service, Princeton, N. J., 1967. 
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some kind of spatial concept" such as reproduction of geometric de- 
signs or drawing the human fiqure. The other two factors are Personal 
Social Responsiveness— involving knowledge about the child's own per- 
sonal world--and Associative Vocabulary- -requiring the demonstration 
of awareness of the connotation of a word by carrying out some action. 

Raw Scores are converted to percentiles using norming tables pro- 
vided in the manual. Test reliability reported in the PSI manual is 
scant. A .98 correlation was obtained between the score on the com- 
plete version and score on the shortened version for the original 
standardization sample of 171 children. A reliability correlation of 
.95 (a split-half corrected using the Spearman-Brown formula) was ob- 
tained for the shortened version using scores made by the same standar 
dization sample. 

3. The Innovative Behavior Test (IB). This instrument is one sub- 
test in the Cincinnati Autonomy Test Battery developed by Thomas J. 
Banta . It was designed to measure problem-solving ability In young 
children by having them t»"ace different ways to get a dog to a bone. 
The test battery has not been published; consequently, there is no 
reliability Information available. Test results are reported in raw 
score form and range from 0 to 30. 

4 . The Categories Test ("C" test). This test was internally de- 
veloped to test the ability of young children to make simple classifi- 
cations. For example, the child is handed a flashlight and shown nine 
objects. One of the objects is a light bulb that "goes with" the 
flashlight. The child Is correct if he points to or names the correct 
response. Results are also in raw score form and range from 0 to 9. 




i 

’’Banta, Thomas 0., "Tests for the evaluation of Early Childhood Education: The 
Cincinnati Autonomy Test Battery (CATS)", (In published paper), University of 
Cincinnati, Cincinnati, Ohio 1968. 
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Although the Categories test has been used In other studies, the 

4 

reliability has not been reported . 

B. First Grade Tests 

The test administered to the two first grade classes was the Metro- 
5 

poll tan Readiness Test . 

The Metropolitan Readiness Tests were developed to be administered 
at the end of kindergarten or the beginning of first grade to provide an 
objective reliable measure of readiness to do first grade work. The test 
Is group administered and a child's total score Is made up of Items In 
six areas: 

1. Word meaning - range of Vocabulary and verbal concepts, 

2 . Listening - comprehension of phrases and sentences, sustained 

attention, capacity for inference, 

3 Matching - visual perception of similarities and differences 
In word forms and figures, 

4. Alphabet - ability to recognize lower-class letters of the alDhabe 

by name, 

5. Numbers - number maturity, vocabulary, counting, recognition of 

written numerals, interpreting number symbols, simple 
numerical problems, and 

6. Copying - combination of visual perception and motor control that 

Is Important In learning to write. 

For Interpreting test results the test manual provides five levels of 
"readiness status" as well as percentile ranks for beginning-first grade 
and end-of-klndergarten groups. The norms are based on a national sample 

^Nimnlcht, Glen R. and others, "Interim Report: Research on the New Nursery 
School", Colorado State College, Greeley, Colorado, Dec. 1967. 

O ^Ulireth, G. H., Griffins, N.L. and HcGauvran, "Metropolitan Readiness Tests - 

"Manual olf Directions", Harcort, Brace and World, Inc., Test Department, N. > 
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of 12,231 children. In spite of t’nelr efforts to develop representa- 
tive norms the authors caution that "the f inal standarlzation group may 
be slightly superior to the national average with respect to median In- 
come and average schooling of adults In the communities..." 

III. Reliability of the Tests 

Before test results can be discussed, It Is Important to know how "reli- 
able" the test Is. Reliability Is how consistently a test measures whatever 
it does measure. One way to obtain evidence of test reliability is to cal- 
culate a correlation coefficient between successive administrations of the 
test. 

To some extent the magnitude of the reliability coefficient arrived at 
by correlating successive administrations will depend on the length of time 
between the administrations. If the re-administrations are very close to- 
gether In time, the results may be Influenced by practice. When the time 
between successive administrations Is long, part of the fluctuation In scores 
may be due to different rates of growth or change In whatever the test mea- 
sures . 

To get evidence of the reliability of the HPPSI, IB and C tests used In 
kindergarten classes, correlation coefficients were calculated on successive 
administrations of the tests a month apart. Since PSI scores are transformed 
to percentile scores before the correlations were run, the correlation between 
successive administrators of the PSI Is not reported. 

The correlations calculated on successive administrations one month aoart 
for the HPPSI, IB and C tests are reported In Table I. 



o 
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TABLE I 

CORRELATION COEFFICIENTS CALCUl ATED BETWEEN SUCCESSIVE ADMINISTRATIONS Or THE 
WPPSI, INNOVATIVE BEHAVIOR AND CATEGORIES TEST FOR POOR AND NON -P OOR CHILDRE N 



TEST 


POOR* 




• 


NON-POOR 




N 


r 


N 


r 


WPPSI 


0*2) 


.78 




.77 


IB 


(121) 


.34 


(48) 


.40 


C 


(HD 


.28 


(31) 


-.15 



As shown in Table I the WPPSI has respectable reliability when used with 
poor and non-poor children. The Innovative Behavior and Categories tests, how- 
ever, demonstrate extremely low reliability for both poor and non-poor children. 
This reflects the developmental stage these tests are in along with the limited 
range of total possible scores a child can get on each of them. 

Based on the evidence reported above and on published reliability data on 
the PSI, the WPPSI and PSI are reliable instruments when used with poor and 
non-poor children. As indicated by the extremely low reliability coefficients 
on successive administrations, the Innovative Behavior and Categories tests 
are not reliable. Performance measures resultin: from these two instruments 
are not precise and conclusions based on these test scores should be considered 
tenuous . 

The reliability of the Metropolitan Readiness Test total score is adequate. 
At three different testing sessions on 167, 173 and 200 children, the corrected 
odd-even reliability coefficients for the total test were .91, .91 and .94 
respectively. Reliabilites of subtests were lower and the authors cautioned 
the user when working with subtest scores. 



Throughout this paper, "poor" children are children who meet the Head Start 
Guidelines for poverty as outlined by the Federal Government. 



0 




IV. 



Kindergarten Test Results by Community 

This section gives a short report on the average test scores made by 



9 



poor and non-poor kindergarten children In seven of eight participating 
communities. Communities are Identified by letter and not by name. The 
test results for Community C's kindergarten children are not Included In 
this report since there were testing difficulties and the accuracy of the 
test scores Is questionable. The difficulties In testing are reflected In 
the childrens' changes on the WPPSI. Thirty-five of Community C's 51 poor 
Follow Through kindergarten children made lower WPPSI scores at the end of 
the year than they had at the beginning. Some decreases were as much as 
16 points. 

Also, It should be remembered that reported PSI scores are median oer- 
centiles. Percentile scores were calculated for both poor and non-DOor 
children using middle-class norms. 

Community A 

Table 2 shows the test performance made by children In Community A. 

As shown, both poor and non-poor children made considerable gains on 
WPPSI test. On the average, thirty-six poor children scored about six points 
below the national average Intelligence of 40 at the beginning of the year. 

At the end of the year 33 of the children who were posttested made an Average 
WPPSI score of 40; an average six-point Increase. Verbal gains were slightly 
higher than performance gains. 

Although non-poor children In Community A also made gains on the WPPSI, 
they began the year testing far above the general population. The gains made 
over the six-month period place them approximately one standard deviation 
(10 pts.) above the Intelligence level of children their age. Performance 
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TABLE 2 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI, PSI, IB AND C TEST MADE 
BY POOR AND NON-POOR FOLLOW THROUGH CHILDREN IN COMMUNITY A 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


AVG. POST SCORE 


CHANGE 


POOR WPPSI VERBAL 


(36) 


16.69 


(33) 


19.79 


3.10 


WPPSI PERFORMANCE 


(36) 


17.28 


(33) 


19.70 


2.42 


TOTAL WPPSI 


(36) 


33.97 


(33) 


39.79 


5.82 


PRESCHOOL INV.* 


(59) 


5.00 


(59) 


15.00 


10.00 


INN. BEH. 


(24) 


7.79 


(21) 


7.95 


0.16 


C TEST 


(24) 


3.50 


(21) 


5.52 


2.02 


NON- WPPSI VERBAL 


(37) 


24.14 


(32) 


25.09 


0.95 


POOR 












WPPSI PERFORMANCE 


(37) 


23.32 


(32) 


26.25 


2.93 


TOTAL WPPSI 


(37) 


47.46 


(32) 


50.72 


3.26 


PRESCHOOL INV.* 


(71) 


95.00 


(71) 


95.00 


0.00 


INN. BEH. 


(29) 


13.14 


(25) 


12.04 


-1.10 


C TEST 


(29) 


5.62 


(25) 


7.20 


1.58 



♦Represents median percentile scores. 

on the PSI, reflected by medaln percentile scores, increased for poor child- 
ren. Since the non-poor children recorded such a high median percentile 
score on the pretest, changes for this group were negligible. 

In all cases poor children tested lower at the beginning of the year than 
non-poor children. Note that the non-poor children decreased in Innovative 
Behavior score. 
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Community B 

Test results for Community B are found in Table 3, 

TABLE 3 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI . PS I, IB AND C TEST MADE 
BY POOR AND NON-POOR FOLLOW THROUGH CHILDREN IN COMMUNITY B 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


AVG POST SCORE 


f 

CHANGE 


POOR WPPSI VERBAL 


(90) 


17.01 


(79) 


19.75 


2.74 


WPPSI PERFORMANCE 


(90) 


16, 24 


(79) 


19.19 


2,95 


TOTAL WPPSI 


(90) 


33.26 


(79) 


39.04 


5,78 


PRESCHOOL INV. 


TftT 


10,00 


(74) 


32.50 


22,50 


INN. BEH. 


(91) 


7,39 


(80) 


10,88 


3,49 


C TEST 


(91) 


M2 


(80) 


6.16 


1,74 


NON- WPPSI VERBAL 


(9) 


14.78 


(8) 


18.75 


3,97 


POOR 












WPPSI PERFORMANCE 


(9) 


11,89 


(B) 


16,88 


4,95- 


TOTAL WPPSI 


(a) 


26,67 


(8) 


35,63 




PRESCHOOL INV 


(6) 


2,50- 


(6) 


20.00 


17,50. 


INN. BEH. 


(10) 


7.10 


(8) 


7.88 


0.78 


C TEST 


(10) 


_3.A0_ 


(.81 . 


5.25 


U.85 



As shown in Table 3 the WPPSi gains made by children in Community B were 
substantial. When pretested, the 90 poor children scored about 33 points. 
Posttest poor children Increased by an average of six points. 



WPPSI gains for non-poor children look markedly higher; however, they 
only represent scores made by nine children on the pretest and eight children 
on the posttest. With so few children the average scores could be influenced 
by one or two extremely high scores. 
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Preschool Inventory, Innovative Behavior and Categories test scores also 
increased for both groups. Over the year PSI group scores rose from the 10th 
to the 32nd percentile for the poor children and from the 2nd to the 20th per- 
centile for the few non-poor children. A puzzling thing about Community B 

is that non-poor children were uniformly lower than poor children. 

Community D 

Test results for Community D are found In Table 4. 

TABLE 4 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI , PSI, IB AND C TEST MADE 
BY POOR AND NON-POOR FOLLOW THROUGH CHILDREN IN COMMUNITY D 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


AVG POST SCORE 


CHANGE 


POOR WPPSI VERBAL 


(31) 


16.19 


(31) 


. 17.71 


1 .52 


WPPSI PERFORMANCE 


(30 


18.55 


(31) 


21.23 


2.68 


TOTAL WPPSI 


(31) 


3*1.71 


(31) 


38 94 


4.23 


PRESCHOOL INV. 


(30) 


27.50 


(30) 


90.25 


62.75 


INN. BEH. 


(32) 


7.94 


(30) 


12.27 


4.33 


C TEST 


(32) 


4.91 


(31) 


6.16 


1.25 


NON- WPPSI VERBAL 


(10) 


15.50 


(9) 


16.89 


1.39 


POOR 












WPPS. PERFORMANCE 


(10) 


17.10 


( 9 ) 


20.78 


3.68 


TOTAL WPPSI 


(10) 


32.60 


(9) 


37.67 


5.07 


PRESCHOOL INV 


(9) 


22.50 


(9) 


91.25 


68.71 


INN. BEH. 


03) 


10.77 


03) 


15.15 


4.38 


C TEST 


(13) 


5,15 


03 ) 


6.85 


1.70 



dent. Average Increases in WPPSI scores are 4 points for the poor children 
and 5 points for the non-poor children. Both groups score about the same on 
the measure of Intelligence used in this study. 
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Considerable gains are also made on the Innovative Behavior test as well 
as on the Preschool Inventory. Poor and non-poor children pretested around 
the 25th percentile on the PSI. At the end of the year, the increase in 
test performance placed them above the 90th percentile of middle-class 
children their age. 

Community E 

Table 5 shows the average pre and postest scores for this coimunity. 



TABLE 5 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI, PSI, IB AND C TEST MADE 
BY POOR ANO NON-POOR FOLLOW THROUGH CHILOREN IN COMMUNITY E 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


AVG POST SCORE 


CHANGE 


POOR WPPSI VERBAL 


(57) 


14.14 


(48) 


20.83 


6.69 


WPPSI PERFORMANCE 


(57) 


16.83 


(48) 


21.04 


A. 21 


TOTAL WPPSI 


(57) 


30.97 


(48) 


A 2.77 


11.80 


PRESCHOOL INV. 


(51) 


1 1 .65 


(51) 


23.75 


12. 10 


INN. BEH. 


(57) 


6.35 


m 


8.02 


1.67 


C TEST 


(60) 


A. 90 


(52) 


6.00 


1.10 


NON- WPPSI VERBAL 


(A) 


8.00 


(A) 


19.25 


11.25 


POOR 












WPPSI PERFORMANCE 


(A) 


15.00 


(A) 


22.75 


7-75 


TOTAL V-'OPSI 


(3) 


25.33 


(A) 


A2.00 


16.67 


PRESCHOOL INV 


(5) 


3.33 


(5) 


15.00 


11.67 


INN. BEH. 


(A) 


5.50 


(A) 


6.00 


0.50 


C TEST 


W 


U1 


■ w 


6.25 


2.50 



As shown, when 57 poor Follow Through children in this Community were 
pretested on intelligence, their average score was 31, about one standard 
deviation below the national average. Upon posttesting the average score of 
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48 of these children was 43, slightly above the national average for Intel- 
ligence using this test. This difference represented a chinge of 12 points 
and reflects more of an Increase In the verbal area than It does with Items 
on the performance subtest. 

Although non-poor children recorded an average WPPSI gain of 17 points, 
the gain reflects only three children on the pretest and four on the posttest. 

Innovative Behavior percentile scores made at the end of the year by 49 
poor children were high and reflected a gain of two points from the beginning 
of the year. Gains on the PSI are relatively modest. 

Community F 

Test results for children in Community F are found in Table 6. 



TABLE 6 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI, PSI, IB AND C TEST MADE 
BY POOR AND NON-POOR FOLLOW THROUGH CHILDREN IN COMMUNITY F 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


AVG POST SCORE 


CHANGE 


Kf lWJ U 1 

POOR WPPSI VERBAL 


(20) 


16.95 


(18) 


18.67 


1.72 


WPPSI PERFORMANCE 


(20) 


17-40 


08) 


19.06 


1.66 


TOTAL WPPSI 


(20) 


34.35 


(18) 


36.39 


2.04 


PRESCHOOL INV. 


(H) 


14.25 


(11) 


22,50 


8.25 


INN. BEH. 


(20) 


7.80 


(17) 


7.94 


0.14 


C TEST 


(20) 


3.90 


(17) 


5-35 


1.45 


NON- WPPSI VERBAL 


(21) 


19.95 


(20) 


18.90 


-1.05 


POOR 












WPPSI PERFORMANCE 


(21) 


19.05 


(20) 


20 . 20 


1.15 


TOTAL WPPSI 


(21) 


39.00 


(20) 


38.10 


-0.90 


PRESCHOOL INV 


(15) 


55.00 


(15) 


82.50 


27.50 


INN. 8EH. 


(23) 


7.83 


(21) 


8.57 


0.74 


C TEST 


(23) 


4.00 


{ 21 L 


5.2J 


UZ-1 
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As Table 6 shows > the poor Follow Through children In Community F in- 
creased their average scores on all measures. The MPPSI scores Increased 
on the average about two points and resulted in an average posttest score 
of 36 for the 18 children tested. A score of 36 is below the average for 
the general population of children. 

The non-poor children scored about the same on the two administrations 
of the WPPSI. The average total WPPSI score for the 20 non-poor children 
tested at the end of the year Is about one point lower than the average 
score for these same children at the beginning of the year. The median 
PSI score for the poor and non-poor children Increased over the year. 

With both groups of children, average scores made on the Categories 
test Increased approximately two points from the beginning to the end of 
the year while the Innovative Behavior Test recorded negligible gains. 
Community G 

Results for Community 6 are found in Table 7. 



TABLE 7 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI, PSI, IB AND C TEST MADE 
BY POOR AND NON-POOR FOLLOW THROUGH CHILOREN IN COMMUNITY G 



■GROUP 


TEST 


N 


AVG. PRE SCORE 


N 


AVG POST SCORE 


j CHANGE 


POOR 


WPPSI VERBAL 


(69) 


18.00 


(63) 


19.92 


1 .92 




WPPSI PERFORMANCE 


(69) 


17.68 


(63) 


20.27 


2.59 




TOTAL WPPSI 


(69) 


35.5*1 


(63) 


40.19 


4.65 




PRESCHOOL INV. 


(55) 


50.00 


(55) 


65.00 “ 


15.00 




INN. BEH. 


(76) 


5.90 


(65) 


7.71 


1 .81 




C TEST 


(76) 


4.61 


(62) 


5.73 


1.12 


NON- 


WPPSI VERBAL 


(33) 


17.91 


(31) 


20.77 


2.86 


POOR 










WPPSI PERFORMANCE 


(33) 


19.36 


(3!) 


21.84 


2.48 




TOTAL WPPSI 


(32) 


36.66 


(31) 


42.61 


5.95 




PRESCHOOL INV 


(32) 


72.50 


(32) 


85.00 


12.50 


0 

ERIC 


INN. BEH. 


(35) 


5.71 


(33) 


7.09 


1.38 




r TFST 


MO 


v A . ?6 • 




« 7ft 


ft Lit 
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Intelligence scores In Community G were an average five points higher 
at the end of the school year for approximately 100 poor and non-poor child- 
ren. This Increase resulted In an average posttest WPPSI score of 43 for 
the non-poor children and 40 for the poor children. PSI median scores also 
Increased about equally for both groups as did scores made on the Innovative 
Behavior subtest. 

Both poor and non-poor children made slight gains on the Categories Test, 
Community H 

There were no non-poor Follow Through kindergarten children In Community 
H. Test results for poor kindergarten children at this site are found in 

Table 8. 



TABLE 8 

AVERAGE PRE, POST AND CHANGES ON THE WPPSI, PSI, IB AND C TEST NADS 
8Y POOR FOLLOW THROUGH CHILOREN IN COMMUNITY H 



GROUP TEST 


N 


AVG. PRE SCORE 


N 


— i 

AVG POST SCORE 


CHANGE 


POOR WPPSI VERBAL 


(24) 


22.00 


(26) 


23.42 


1.42 


WPPSI PERFORMANCE 


(24) 


16.75 


(26) 


17.81 


1.06 


TOTAL WPPSI 


(24) 


38.75 


(26) 


41.23 


2.48 


PRESCHOOL INV. 


TT5T 


11.00 


<i6) 


2V. 00 ' 


13.00 


INN. BEH. 


(29) 


5.07 


(23) 


9.30 


4.23 


C TEST 


(30) 


3,77 


(23) 


7.39 


3.62 


^ „ . j 





. . 




- • - 





Poor children in Community H also demonstrated positive changes on all 
measures. The WPPSI scores Increased on the average approximately 3 points. 
The 26 children posttested on the WPPSI at the end of the year score slightly 
above the national average on this test. 



O 
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Similar Increases over the school year were made on the PSI. Innovative 
Behavior and Categories test scores showed large Increases. 

Su mmary of Kindergarten Test Results 

Since the WPPSI Is becoming one of the most reputable national instru- 
ments for measuring Intellectual capacity In children aged four to six and 
a half, It seems appropriate to summarize how well poor and non-poor children 
In Responsive Environment classrooms performed on the four subtests of this 
Instrument. Table 9 does this. 

TABLE 9 

A SUMMARY OF TOTAL WPPSI PRE- AND POSTTEST SCORES FOR POOR AND NON-POOR CHILDREN 

IN SEVEN COMMUNITIES 



Communi ty 


Group 


M 


Pre Score* 


M 


Post Score* 


Change 


A 


poor 


(36) 


34 


(33) 


40 


6 




non-poor 


(37) 


47 


(32) 


51 


A 


B 


poor 


(90) 


33 


(79) 


39 


6 




non-poor 


(9) 


27 


(8) 


36 


9 


D ' 


poor 


(31) 


35 


(31) 


39 


4 




non-poor 


(10) 


33 


(9) 


38 


5 


E 


poor 


(57) 


31 


(48) 


43 


12 




non-poor 


(3) 


25 


(4) 


42 


17 


F 


poor 


(20) 


34 


(18) 


36 


2 




non*poor 


(21) 


39 


(20) 


38 


-1 


G 


poor 


(69) 


36 


(63) 


40 


4 




non-poor 


(32 


37 


(31) 


43 


6 


H 


poor 


(24) 


39 


(26) 


41 


2 




non-poor 


(non 


aval lable) 








TOTAL 


poor 


(327) 


34 


(298) 


40 


6 


(weighted) 


non-poor 


(112) 


39 


(104) 


43 


4 


* Average 


pre and pos 


ttest 


scores were 


rounded to 


the nearest 


whole number. 



As Indicated In Table 9, average WPPSI Intelligence scores made by poor 



groups of children Increased In every comnunlty. The average Increase for 

O 
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poor children in all communities was six points (see Figure 1). 




average posttest WPPSI scores for 
poor and non-poor Follow Through 
chi Idren. 



Poor children on the average scored 34 at the beginning of the year. 

This was slightly below the national average level of Intelligence. At the 
end of the year-long kindergarten experience, their Intelligence scores were 
equivalent to the national average of 40. 

Non-poor children in the Responsive Environment classrooms made less 
change In WPPSI scores throughout the year. These children, however, began 
the school year at a higher level of intelligence as measured by the WPPSI. 
One hundred and twelve non-poor children made an average score of 39 in the 
WPPSI. The posttest average score of 104 of these children was 43. 

There was considerable variability on WPPSI group changes made by poor 




and non-poor children in different communities. As shown In Table 9 and 
Figure 2, the poor and non-poor groups in Community E Increased 12 and 17 
WPPSI points respectively. 
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Figure 2. Changes between pretest and posttest poor 
and non*poor Follow Through groups in 
seven commun i t I es . 

In Community B, where children pretested about the same as children in 
Community E group changes were on the average 6 points for the poor group and 
9 points for the non-poor group. At only one site, Community F, there was a 
decrease in WPPSI scores from pretest to posttest. This decrease was one 
point for non-poor children. 

Providing a general overall summary of performance by kindergarten child- 
ren on the Preschool Inventory Is difficult. A kindergarten child's score 
on the PSI was transferred to a percentile score using middle-class norms 
for both poor and non-poor children. Median percentile scores were then re- 
ported for poor and non-poor groups In each community. Because test pro- 
tocols were returned to their respective communities, raw PSI scores could 
not be retrieved and analyzed to provide a more comprehensive summary. 



O 
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However, Table 10 shows the median PSI percentile score for poor and 
non-poor children in the kindergarten Follow Through communities. 

TABLE 10 

MEDIAN PRESCHOOL INVENTORY PRE- AND POSTTEST SCORES AND CHANGES FOR POOR AND 
NON-POOR CHILDREN IN SEVEN COMMUNITIES USING MIDDLE CLASS NORMS 



COMMUNITY 


GROUP 


N 


PRETEST MDN . SCORE 


POSTTEST MDN. SCORE 


MEDIAN INCREASE 


A 


Poor 


59 


5.00 


15.00 


+10.00 




Non-Poor 


71 


95.00 


95.00 


0.00 


6 


Poor 


7A 


10.50 


32.50 


+22.50 




Non-Poor 


6 


2.50 


20.00 


+17.50 


D 


Poor 


30 


27.50 


90.25 


+62.75 




Non-Poor 


9 


22.50 


91.25 


+68.75 


E 


Poor 


51 


11.65 


23.75 


+12.10 




Non-Poor 


5 


3.33 


15.00 


+11.67 


F 


Poor 


11 


IA.25 


22.50 


+ 8.25 




Non-Poor 


15 


55.00 


82.50 


+27.50 


G 


Poor 


55 


50.00 


65.00 


+15.00 




Non-Poor 


32 


72.50 


85.00 


+12.50 


H 


Poor 


16 


11.00 


2<«.00 


+13.00 




Non-Poor 


A 








TOTAL 


Poor 


296 


11.80 


26.10 


+ H.30 


Non-Poor 


138 


22.50 


72.50 


+50.00 




♦There were 


no non- 


poor Follow Through children in Community H. 


— * 



As shown, median scores increased approximately 14 percentile points for 



296 poor children and 50 points for the 138 non-poor children. When ana1y2ed 
by communities, it will be noticed that some groups, such as the poor and non- 
poor in Community D, showed considerable Increase in median percentile scores. 
Another group, the non-poor in Community A, because the scores were so high 
on the pretest, showed no median Increase. 
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In Communities 8 and E the posttest PSI median scores was hiqher for 
the poor group. No conclusion can be made however, because of the few 
children In the non-poor groups In these two communities. 

V . F irst Grade Test Results 

Two communities had first grade Follow Through classes In 1968. As 
was discussed, the Metropolitan Readiness Test was administered to all 
children In each of these communities at the beginning and end of the year 
Test scores were not reported separately for economic levels, since almost 
all children In Communities I and 0 are poor. Table 11 shows the average 
Metropolitan raw scores for first graders In these communities. 

TABLE 11 

FIRST GRADE METROPOLITAN PRE-AND POSTTEST RAH SCORES 








Average Scores 






Communities 


N 


Pretest 


Posttest 


Change 


I 


293 


39.63 


66.10 


26.57 


0 


124 


41.15 


?1 .96 


30.81 


Total (weighted) 417 


40.88 


67.84 


27.76 



As Indicated bbove, first grade children In each community performed! 
relatively low on the pretest. The children scored on the average about 
40 out of the 102 total possible score. A score of 40 corresponds to the 
23rd percentile rank for children In the standarizatlon sample. According 
ot the test developers a score of 40 would categorize a child as havlnq a 
"Low Normal readiness status". That Is, that child Is "likely to have dif- 
ficulty In first qrade work" and "should be asslngned to a slow section and 
given more Individualized help." This low score Is understandable knowing 
8$X of the children In Coraunttles I and 0 are children from disadvantaged homes. 
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VI. 




the end of their first grade experience the children scored con- 
siderably higher on the Metropolitan. The average posttest raw score for 
the 417 first graders tested was 68. A score of 68 falls at the 77th 
percentile on norms for beginning first grade children. However, consider- 
ing the low socio-economic status of the children in Community I and 0, 
and the fact that the Metropolitan norming group was somewhat biased to- 
ward higher income families, it Is reasonable to conclude that first graders 
in Communities I and 0 would compare very favorably to norms calculated at 
the end of the first grade experience if they were available. 

A child with a raw score of 68 would be classified by the test developers 
(at the beginning of the first grade experience) as having "high normal readi 
ness status. 11 This child would be characterized as having "good orospects 
for success in first-grade work provided other Indications, such as health, 
emotional factors, etc. are consistent." 

Overview 

This paper Is a preliminary report of objective test results made by ap- 
proximately 300 poor and 100 non-poor kindergarten children and by 417 first 
grade children during 1968-1969. The test results were used to measure 
changes in a child's intellectual ability as a result of one year's schooling 
in a Responsive Follow Through classroom. Except for four Instances, average 
test scores made by groups of both poor and non-poor kindergarten Follow 
Through children Increased for all tests in every community. That Is, over 
the seven to nine month time period, kindergarten children In all communities 
Increased in intelligence, the ability to form concepts, the ability to solve 
problems and the ability to categorize. Results for first grade children 
were also encouraging. There was considerable Increases In Metropolitan test 
scores for both first grade communities from the beginning to end of the 

school year. 
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The test results reported in this paper were summarized by grade within 
the community. For each community of kindergarten children, the performance 
of children classified as poor was separate from performance made by non- 
poor children. These breakdowns reflect the least number of ways the test 
results had to be categorized before they could be meaningful; but this Is 
a limitation. 

There are other ways to summarize children's scores. For example, a 
more Important question could be answered by determining children's per- 
formance as related to their year-long experiences In a soeclflc Responsive 
classroom. To answer this question, the degree to which the Responsive 
model has been carried out in each classroom would first be determined. 

Since we already know that in different communities poor and non-poor child- 
ren score differently on the tests, the analysis would then take into con- 
sideration variable of grade, community, poor vs. non-poor, and the degree 
to which the model has been implemented by a teacher and her assistant. 

Using the results obtained by the Classroom Observation Schedule , an 
index of the degree of "responsiveness" will be figured for each classroom. 
This variable will then be correlated with test performance. Other analyses 
must also be made in examining the effectiveness of the Responsive model. 

For example, questions must be answered dealing with the model's effective- 
ness with different ethnic groups, for different sexes, or for children 
where the primary language in the home is not English. Biographic and demo- 
graphic information provided on the Child Data Sheet will help us answer 
these questions. .. 
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.Undoubtedly the major questions that must eventually be resolved are 
concerned with the long-term effects of the program and hoi children In the 
Responsive Follow Through Model compare with children In other Follow 
Through models or In conventional classroom settings. These questions must 
take Into account attitude toward school, self-concept related to school, 
along with concept formation, and problem-solving abilities. Answers to 
these questions will Influence not only the nature of early childhood edu- 
cation, but the nature of all education. For this reason, evaluation In 
the Responsive Model must continue to be an Integral part of the program. 
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